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nNA gonstract and its use. 
The present invention relates to a new DNA construct for transformation into oilseed 
plants. The DNA construct comprises nucleotide sequences encoding peptides with enzyme 
5 activities necessary for the high-level production and esterification of keto group-containing 
xanthophylls in oilseed plants. 

Background of the invention 

Carotenoids are produced de novo by plants, fungi, algae and some bacteria. A 
number of biosynthetic steps are needed for the biological production of the carotenoids. 
10 There are two chemically different groups of carotenoids, namely carotenes containing only 
carbon and hydrogen molecules and xanthophylls containing oxygen in the molecule in 
addition to carbon and hydrogen. 

The xanthophylls, and particularly astaxanthin (3,3'-dihydroxy-P-P-carotene-4,4'- 
dione), are often colored pigments and are used as such or as anti-oxidants. 
1 5 Carotenes are biological precursors for the production of the oxygen-containing 

xanthophylls. There are two types of enzymes responsible for the introduction of hydroxy 
groups and keto groups into the carotenes, namely hydroxylases and ketolases, respectively. 

The keto group-containing xanthophyll astaxanthin, which has keto and hydroxy 
groups, is biosynthetically produced from beta-carotene. 
20 Large-scale production of xanthophylles from natural sources is at present performed 

by AstaCarotene AB, Gustavsberg, Sweden, by cultivation of the alga Haematococcus 
plwialis for the production of astaxanthin in esterified form. 

It would be desirable to be able to produce keto group-containing xanthophylls 
particularly astaxanthin, in oilseed plants. Oilseed plants have naturally P-carotene 
25 hydroxylases but lack p-carotene C-4-oxygcnase enzymes or ketolases. 

Description of the invention 
The present invention provides DNA constructs enabling and promoting 
production of keto group containing xanthophylls, especially astaxanthin, in oilseed plants, 
such as rape, simflower, soybean and mustard. The DNA construct is transformed into the 
30 oilseed plant cell for expression of a protein or fused protein which has an enzyme activity 
enabling keto group insertion into a carotene or hydroxy carotene for the biosynthetic 
production of a keto group containing xanthophyll, such as cantaxanthin (P,P-carotene-4,4*- 
dione) and/or astaxanthin. Use is thus made of the biosynthetic pathway of the oilseed plant to 


wo 01/20011 


PCT/SE0(W01767 


2 

produce carotenoids. The naturally occulting synthesis of carotenoids involves a number of 
enzymes, namely 1-D-deoxyxylulose 5-phosphate synthase, isopentenyl 
pyrophosphaterdimethylallyl pyrophosphate isomerase, geranylgeranyl pyrophosphate 
synthase, phytoene synthase, phytoene desaturase, zeta^arotene desaturase, lycopene beta- 
5 cyclase, (i-carotene hydroxylase, and P-carotene C-4-oxygenase. Genes coding for peptides 
having these enzymatic activities may be inserted into the DNA construct of the invention, 
one or several per construct, to promote high-level production in the transgenic oilseed plant. 
In case only one enzyme coding gene is ins^ed per plant, two or more plants may be 
sexually interbred to produce plants containing all the desired enzyme activities. 

1 0 Thus, the present invention is directed to a DNA construct comprising in the S ' to 3 ' 

direction of transcription operably linked a promoter region directing transcription to the seed 
of an oilseed plant, a nucleotide sequence coding for at least one peptide with enzyme activity 
necessary for keto group containing xanthophyll production and esterification in an oilseed 
plant and a transcriptional termination region. 

15 In a preferred embodiment of the invention the DNA construct additionally 

comprises between the promoter region and the nucleotide sequence coding for at least one 
peptide with enzyme activity a nucleotide sequence coding for a transit peptide directing the 
translated fusion polypeptide to the chloroplast of the oilseed plant. 

The DNA construct is preferably such that the promoter is a napin promoter, the 

20 peptide with enzyme activity necessary for keto group containing xanthophyll production is 
selected from the group consisting of peptides with 1-D-deoxyxylulose S-phosphate synthase, 
isopentenyl pyrophosphate:dimethylalIyl pyrophosphate isomerase, geranylgeranyl 
pyrophosphate synthase, phytoene synthase, phytoene desaturase, zeta-carotene desaturase, 
lycopene beta-cyclase, P-carotene hydroxylase, and P*carotene C-4-oxygenase activity. To 

25 promote esterification of astaxanthin a nucleotide sequence coding for a peptide with acyl 
transferase activity may be included in the group. 

In a preferred embodiment of the DNA construct according to the invention the 
nucleotide sequence coding for a peptide with enzyme activity is a nucleotide sequence 
coding for a N-terminally truncated P-carotene C-4-oxygenase gene from the alga 

30 Haematococcus pluvialis. 

An example of the DNA construct of the invention is presented in the sequence 
listing as SEQ ID NO:l and in Fig.l. 
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The present invention is also directed to a transgenic oilseed plant cell 
comprising the DNA construct of the mvention, and preferably die oilseed plant is selected 
from the group consisting of rape, sunflower, soybean and mustard. 

The invention is additionally directed to transgenic oilseed plant-produced 
S xanthophyll, e.g. canthaxanthin and astaxantfain. 

A preferred aspect of the mvention is directed to transgenic oilseed plant- 
produced astaxanthin esters. 

The present invention will now be illustrated with refemce to the DNA 
construct disclosed in the sequence listing and in Fig.l, and the following description of 
1 0 embodiments. However, the invention is not limited to these exemplifications. 

Short description of the drawings 
Fig.l illustrates the nucleotide sequence of the DNA construct comprising the wpin promoter, 
the chloroplast localization signal, the N-terminally truncated p-carotenc C-4-oxygenase gene 
and the termination sequence, and the deduced amino acid sequences of the transit pq[>tide 
1 5 and the p-carotene C-4-oxygenase. 

Description of embodiments 
The invention is illustrated by production of astaxanthin in the seed of oilseed 
TBpc. The astaxanthin produced in the seed of the transgmic plant is extracted as part of the 
extracted oil. By use of conventionally used protocols for Agrobacterium tumefaciens 
20 mediated transfomiation such as described by (Hoekema et al.l983. An et al. 1986, Fry et al. 
1987, DeBlock et al. 1988, Radke et al.l988, or Moloney et al. 1989) transgenic plants are 
produced having a chuneric DNA construct that is gmetically inherited and is able to produce 
astaxanthin. The nucleotide sequence of the chimeric DNA constmct consist of four parts of 
different genetic origin namely: (1) a promoter, (2) a localization signal, (3) a P-carotene C-4- 
25 oxygenase coding region and (4) a termination sequence. 

The napin promoter directs transcription to the seed of oilseed rape (StUberg et 
al 1996). This promoter was cotipled to a localization signal similar but not identical to a 
transit peptide (TP) of Rbcsla (Krebbers, 1988) that directs the translated product of a fixscd 
gene to the chloroplast The promoter and the TP sequence were ligated to a part of the coding 
30 sequence of a ketolase gene BCK (Kajiwara et al. 1995). This enzyme oxygenates p-carotene 
to canthaxanthin, (Fraser et al, 1997). The chknmc DNA construct was then coupled to a 
suitable termination sequence, e.g. that of the AgrobactCTium tume&ciens nopaline synthase 
gene (the nos 3' end)(Bevan et al. 1983), as iUustrated in Fig.L 
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Cellular storage of Astaxantin 

The storage of large amounts of free astaxanthin in plants will be difficult due to 
toxic effects of the molecule as it intercalates in the plant membranes. An effective 
esterification of astaxanthin to fatty acids enables storage of the esterified molecules in 
triacylglycerol containing oleosomes. Thus, an acyl transferase can be claimed to be of 
fundamental importance for the process, as is proteins that can mediate transport of different 
forms of astaxanthin from the chloroplast to the vesicles. 

Sequences and oligonucleotides used in the construction of the D NA constmct 
L Napin promoter (GeneBank ACCESSION No. J02798) 

This promoter sequence, a 1 145 base pair fragment including the S* leader 
sequence has a unique Hindin site at the 5* end. The 3' end was synthesized with an 
additionally 6 nucleotide BamHI site. 

2. Transit peptide similar to RBCSla (GeneBank ACCESSION No. XI 3611. XI 4565) 

The transit peptide (TP) was amplified by PGR from -28 to the end of the transit 
cleavage aa=54/55 site of the Rbcsla gene. The 5* end was synthesized with a BamHI site 
and similarly the 3' sequence was synthesized with a Xbal site. The two following 
oligonucleotides were used for the PGR amplification. 

BamHI 

5' primer: TPl 5'AGAG GGATCC TCAGTGAGAGAAAGAGTA 3' 

Sad Xbal 

3 • primer: TP2 5 'GTTC GAGCTC TGTAG A CATGGAGTTAAGGG 3 ' 

J. BCK (fi-carotene C-4 oxygenase) (Genebank ACCESSION No, D45881) 

The BCK fragment was amplified by PGR including a 5* Xbal site and was 
ligated to the TP already described. The 5* primer (BCKl) used for PGR, is homologous to 
the BCK sequence fix)m nucleotide 264 and the 3' oligonucleotide (Ax40) ends with a stop 
codon and was synthesized with a Sad restriction site for cloning. The synthesized fragment 
was fused to the TP as shown in Fig 1 . 
Oligonucleotides used for PGR: 

Xbal 

5 ' primer: BGKl 5 ACAG TCTAGA ATGGCATGCGAGTGGTGA 3 ' 

Sad 

3 primer: AX40 5 G ACCG AGGTGGATGACACTGTTGTGGAGA 3 ' 
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Description of SEQ ID NO:l and SEQ ID NO:2 

The sequences shown i Fig.l are the same as the two sequences which are 
shown in the sequence listing. 

The SEQ ID NO:l is a nucleotide sequence composed of the following features: 

Nucleotide No. 
Cloning site Hindm 1 -6 

Napin Promoter 1-1 145 

Cloning site BamHI 1 146-1 151 

Transit peptide leader 1 1 52-1 1 78 

Transit peptide coding 1 179-1347 

Cloning site Xbal 1348-1353 
P«-carotene C-4-oxygenase 1354-2217 
P-carotene C-4-oxygOTse 3' untranslated 2218-2266 
Cloning site SacI 2267-2272 
Nopaline synthetase termination 2273-2536 
Cloning site EcoRI 2538-2543 


The SEQ ID NO: 2 is a deduced amino acid sequence of the fusion protein of 
the transit peptide and the peptide with p-carotene C-4-oxygenase activity. 
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Claims 

1. A DNA construct comprising in the 5' to 3' direction of transcription operably 
linked a promoter region directing transcription to the seed of an oilseed plant, a nucleotide 
sequence coding for at least one peptide with enzyme activity necessary for keto group 

S containing xanthophyll production and esterification in an oilseed plant and a transcriptional 
temiination region. 

2. The DNA construct according to claim 1 , which between the promoter region 
and the nucleotide sequence coding for at least one peptide with enzyme activity additionally 
comprises a nucleotide sequence coding for a transit peptide directing the translated fusion 

1 0 polypeptide to the chloroplast of the oilseed plant. 

3* The DNA construct according to claim 1 or 2, wherein the promoter is a napin 

promoter, the peptide with enzyme activity necessary for keto group containing xanthophyll 

production and esterification is selected from the group consisting of peptides with, 1-D- 

deoxyxylulose 5-phosphate synthase, isopentenyl pyrophosphateidimethylallyl pyrophosphate 
IS isomerase, geranylgeranyl pyrophosphate synthase, phytoene synthase, phytoene desaturase, 

zeta-carotene desaturase, lycopene beta-cyclase, P-carotene hydroxylase, P-carotene C-4- 

oxygenase, and acyl transferase activity. 

4* The DNA construct according to any one of claims 1-3, wherein the 

nucleotide sequence coding for a peptide with enzyme activity is a nucleotide sequmce 
20 coding for a N-terminally truncated p-carotene C-4-oxygenase gene from the alga 

Haematococcus pluvialis. 

5. The DNA construct according to claim 4, wherein the nucleotide sequence is 
SEQIDNO:!. 

6. Transgenic oilseed plant cell comprising the DNA construct of any one of 

25 claims 1-5 . 

7* Transgenic oilseed plant cell according to claim 6, wherein the oilseed plant is 
selected from the group consisting of rape, sunflower, soybean and mustard. 

8. Transgenic oilseed plant-produced xanthophyll. 

9. Transgenic oilseed plant-produced xanthophyll according to claim 8, wherein 
30 the xanthophyll is canthaxanthin 

10. Transgenic oilseed plant-produced xanthophyll according to claim 8, 
wherein the xanthophyll is astaxanfhin. 

11. Transgenic oilseed plant-produced xanthophyll according to claim 8, 
wherein the xanthophyll is astaxanttiin esters. 
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Napln promoter 

AAGCTTTCTTa^TOKrrGATTGATTCCT^ 

GGCAAGTATTCAGTTACCAGTTACOICTTATATTCTGGACTTTCT 

TTTCCAAOlTTTTAAATTTCACTATTGGCTGAATGCrTCTTC^^ 

CAQATGGO^GAAATGTATCAACaUWTGCATATATAaUUlTGTACCTC^ 

ATCTATCGGATGGTTCCATTTGCTTTGTCATCaUlTTAGTGACTACnr^ 

TCCTCTTTATTACTATTTTCATGCGAGGTTGCaiTGTACATTATATTTG^ 

GCrATTGAGC GU "lT' r i Vr TCAATTTTCTTTATTOT 

GAGTTGGGTTGAATGAGATATACGTTCAAGTGAAGTGGCATACCGTTCTCGAGTAAG^ 

GACCTACCCATTCTTGAGACUUUITGTTACATTTTAGTATCAGAGTAA^ 

AACTCAAATTCGATTGACATGTATCCATTOUICATAAJ^TTAAACCAGCCT 

TCCACATTTCAAGTATTTXaUlACCGTTCGGCTCCTATCaiCCGGGTGTAAC^^ 

TTCCGAATTTGGAAGATTTTGACrrCAAATTCCCAATTTATATTQACCCT 

CTTTAACTTCTATAATTCTGATTAAGCTCCCAATTTATATTCCCAACGGCACTACCT 

AAATTTATAGACTCTa^TCCCCTTTTAAACCAACTTAGTAAACGTT^^ 

TATGAAGTTAAOTTTTTACCrrrGTTTTTAAAJU^GAATCGTO 

ACATTAGCTACACGTTACACATAGCATGCAGCCGCGGAGAATTGT'rTTTCTTCGCCACTT 
GTCACTCCCrrOUUiaiCCTAAQAGCTTCTCTCTCAC^ 

GTGCATGOlTTATTACACGTGATCGCCATGaUUlTCTCCTTTATAGCCTATAAAOT 
CATCCGCTTCACTCTTTACTOUULCCAAAACTC^ 

Snd -28 uzLtranslated leader TP start 

a^CGAGGATCCTCAGTCACAOUUlGAGTAAAGAAGAACAATOGCCT 

M A S S M I« S 

TCCGCTACrrATGGTTGCCTCTCCGGCTaiGGCCACTATGGTCGCTCCTTTCAACGGAC^ 
SATMVASPAQATMVAPFNGI. 

AAGTCCTCCGCrGCCTTCCCAGCCACCCOCAAGGCTAAOUlCGACAra 
KSSAAFPATRKANKDITSIT 

PIG.l 
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TP End C-4 -Oxygenase 

AGOlACGGCGGACGCGTTAACTGCATGTCTAGAATQCa^TCCaAGTCGTCAGACGa^GCT 
SNGGRVNCMSRKPSESSDA A 

CGTCCTGCGCTAAAGCACGCCTACAAACCTCCAGCATCTGACQCCaiAGGGCATCACGATG 
RPAI.KHAYKPPASDAKGZTM 

GCGCTGACCATCATTGGCACCTGGACCGCAGTGTTTTTACACGCAATATrTaUl^ 
ALTIIGTWTAVFLHAIPQIR 

CTACCGACATCCATGGACCAGCTTCACTGGTTGCCTGTGTCCGAAGCO^CAGCCCAGC^ 
LPTSMDQLHWLPVSEATAQI. 

TTGGGCGGAAGOlGCAGCCTACTGCACATCGCTGaAGTCTTCATTGTACTTG^ 
LGGSSSLLKIAAVFIVZiEFI. 

TACACTGGTCTATTCATCACOlCACATGACGaUlTGaiTGGa^CC^TAGCTT^ 
YTGLFITTHDAMH GTIALRH 

AGGCAGCTCAATGATCTCCTTGG<»AaiTCTGCATATaiCTGTACGCCTGGTTTGACT 
RQI.NDIiI.GNICISIiYAWFDY 

AGCATGCTGCATCGCAAGOlCTGGGAGCACCACAACCATACTGGCGAAQrGGGGAAAG 
SMI.HRKHWEH HNHT6EVGKD 

CCTGACTTCCACAAGGGAAATCCCGGCCTTGTCCCCTGGTTCGCCAGCTTCATGTCCAGC 
PDFHKGNPGZ.VPWFASFMSS 

TACATGTCCCTGTGGCAGTTTGCCCGGCTGGCATGGTGGGCAGTGGTGATGCAAATGCTG 
YMS liWQ FAR IiAWWAVVMQMIi 

GGGGCGCCCATGGCAAATCTCCTAGTCTTCATGGCTGCAGCCCCAATCTTGTCAGCATTC 
GAPMANLLVFMAAAPILSAF 

CGCCTCTTCTACTTCGGCACTTACCTGCCACACyU^GCCTGAGCCAGGCCCTGCAGCAGGC 
RliFYFGTYLPHKPEPGPAAG 

TCTCAGGTGATGGCCTGGTTCAGGGCOIAGACAAGTGAGGCATCTGATGTGATGAGTTTC 
SQVMAWFRAKTSEASDVMSF 

CTGACATGCTACCACTTTGACCTGCACTGGGAGCACCACAGATGGCCCTTTGCCCCCTGG 
LTCYHFDLHWEHHRWPFAPW 

C-4 oxygenase Stop 
TGGCAGCTGCCCCACTGCCGCCGCCTGTCCGGGCGTGGCCTGGTGCCTGCCTTGGCATGA 
WQLPHCRRIiSGRGIiVPAIiA* 


FIG.l (cont. ) 


wo 01/20011 


PCT/SEOO/01767 


3/3 

C-4 oxygenase untranslated region Nos term 

CCTGGTCCCTCCGCTGGTGACCCAGCGTCTGCACAAGAGTGTCATGGAGCTCC3AATTTCC 

CCGATCGTTCAAACATTTGGCAATAAAGTTTCrrAAGAOT 
CGATGATTATCATATAATTTCTGTTGAATTACGTTAAGC^^ 

GCATGACGTTATTTATGAGATGGGTTTTTATGATTAGAGTCCCGCAATTATACATO 

ACGCQATAGAAAACAA2UVTATAGCGCGaUUlCTAGGATAAATTATCGCGCG^ 
end 

CTATGTTACTAGATCGGGAATTC 

Fig. 1 (cont • ) 
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SEQXXBNCE LISTING 


<110> AstaCarotiene AB 


<120> DNA construct and its use 

<130> 29295-AstaCarotene 

<140> 
<141> 

<160> 2 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 2543 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: napin promoter 

+ choroplast localization signal + beta-carotene C-4 oxygenase 
coding sequence -i- termination sequence 

<220> 

<221> promoter 
<222> (1)..(114S) 

<220> 

<221> transit peptide 
<222> (1179) . . (1347) 

<220> 

<221> CDS 

<222> (1179) . . (2217) 
<220> 

<221> terminator 

<222> (2273) . . (2536) 

<400> 1 


aagctttctt 

catcggtgat 

tgattccttt 

aaagacttat 

gtttcttatc 

ttgcttctga 

60 

ggcaagtatt 

cagttaccag 

ttaccactta 

tattctggac 

tttctgactg 

catcctcatt 

120 

tttccaacat 

tttaaatttc 

actattggct 

gaatgcttct 

tctttgagga 

agaaacaatt 

180 

cagatggcag 

aaatgtatca 

accaatgcat 

atatacaaat 

gtacctcttg 

ttctcaaaac 

240 

atctatcgga 

tggttccatt 

tgctttgtca 

tccaattagt 

gactacttta 

tattattcac 

300 

tcctctttat 

tactattttc 

atgcgaggtt 

gccatgtaca 

ttatatttgt 

aaggattgac 

360 

gctattgagc 

gtttttcttc 

aattttcttt 

attttagaca 

tgggtatgaa 

atgtgtgtta 

420 

gagttgggtt 

gaatgagata 

tacgttcaag 

tgaagtggca 

taccgttctc 

gagtaaggat 

480 

gacctaccca 

ttcttgagac 

aaatgttaca 

ttttagtatc 

agagtaaaat 

gtgtacctat 

540 
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aactcaaatt cgattgacat gtatccattc aacataaaat taaaccagcc tgcacctgca 600 

tccacatttc aagtattttc aaaccgttcg gctcctatcc accgggtgta acaagacgga 660 

ttccgaattt ggaagatttt gactcaaatt cccaatttat attgaccgtg actaaatcaa 720 

ctttaacttc tataattctg attaagctcc caatttatat tcccaacggc actacctcca 780 

aaatttatag actctcatcc ccttttaaac caacttagta aacgtttttt tttttaattt 840 

tiatgaagtta agtttttacc ttgtttttaa aaagaatcgt tcataagatg ccatgccaga 900 

acattagcta cacgttacac atagcatgca gccgcggaga attgtttttc ttcgccactt 960 

gtcactccct tcaaacacct aagagcttct ctctcacagc acacacatac aatcacatgc 1020 

gtgcatgcat tattacacgt gatcgccatg caaatctcct ttatagccCa taaattaact: 1080 

catccgcttc actcttCact caaaccaaaa ctcatcaata caaacaagac taaaaacatia 1140 

cacgaggatc ctcagtcaca caaagagtaa agaagaaca atg get tec tct atg 1194 

Met Ala Ser Ser Met 
1 5 

etc tct' tec get act atg gtt gcc tct ccg get eag gee act atg gte 1242 
lieu Ser Ser Ala Thr Met Val Ala Ser Pro Ala Gin Ala Thr Met Val 
10 15 20 

get cet ttc aac gga ctt aag tee tec get gee tte cea gcc ace cge 1290 
Ala Pro Phe Asn Gly Leu I*ys Ser Ser Ala Ala Phe Pro Ala Thr Arg 
25 30 35 

aag get aac aac gae att act tec ate aea age aac gge gga cge gtt 133 8 
Lys Ala Asn Asn Asp He Thr Ser He Thr Ser Asn Gly Gly Arg Val 
40 45 50 

aac tgc atg tct aga atg cea tec gag teg tea gae gea get egt cet 1386 
Asn Cys Met Ser Arg Met Pro Ser Glu Ser Ser Asp Ala Ala Arg Pro 
55 60 65 

gcg eta aag cae gcc tac aaa cet cea gea tct gae gee aag gge ate 1434 
Ala l*eu Lys His Ala Tyr Lys Pro Pro Ala Ser Asp Ala I-ys Gly He 
70 75 80 85 

acg atg gcg ctg acc ate att gge acc tgg acc gea gtg ttt tta cac 1482 
Thr Met Ala Leu Thr lie He Gly Thr Trp Thr Ala Val Phe Leu His 
90 95 100 

gea ata ttt caa ate agg eta ccg aca tec atg gae eag ctt cac tgg 1530 
Ala He Phe Gin He Arg Leu Pro Thr Ser Met Asp Gin I*eu His Trp 
105 110 115 

ttg cet gtg tec gaa gcc aca gee eag ctt ttg gge gga age age age 1578 
Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu Gly Gly Ser Ser Ser 
120 125 130 

eta ctg cac ate get gea gtc tte att gta ctt gag ttc ctg tac act 1626 
Leu Leu His He Ala Ala Val Phe He Val Leu Glu Phe Leu Tyr Thr 
135 140 145 
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ggt eta ttc ate acc aca cat gac gca atg cat ggc acc ata get ttg 1G74 
Gly Leu Phe He Thr Thr His Asp Ala Met His Gly Thr He Ala Leu 
150 155 160 165 

agg cae agg cag etc aat gat etc ctt gge aac ate tge ata tea etg 1722 
Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn He Cys He Ser Leu 
170 175 180 

tae gee tgg ttt gac tac age atg etg cat cgc aag cae tgg gag cae 1770 
Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg Lys His Trp Glu His 
185 190 195 

cae aac cat act ggc gaa gtg ggg aaa gac cet gac ttc cae aag gga 1818 
His Asn His Thr Gly Glu Val Gly Lys Asp Pro Asp Phe His Lys Gly 
200 205 210 

aat cec ggc ctt gtc cec tgg ttc gee age ttc atg tec age tac atg 1866 
Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe Met Ser Ser Tyr Met 
215 220 225 

tec etg tgg cag ttt gee egg etg gea tgg tgg gea gtg gtg atg caa 1914 
Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp Ala Val Val Met Gin 
230 235 240 245 

atg etg ggg gcg cec atg gea aat etc eta gtc ttc atg get gea gee 1962 
Met Leu' Gly Ala Pro Met Ala Asn Leu Leu Val Phe Met Ala Ala Ala 
250 255 260 

cca ate ttg tea gea ttc cgc etc ttc tac ttc ggc act tae etg cca 2010 
Pro He Leu Ser Ala Phe Arg Leu Phe Tyr Phe Gly Thr Tyr Leu Pro 
265 270 275 

cae aag ect gag cca ggc cet gea gea gge tet cag gtg atg gee tgg 2058 
His Lys Pro Glu Pro Gly Pro Ala Ala Gly Ser Gin Val Met Ala Trp 
280 285 290 

ttc agg gee aag aca agt gag gea tet gat gtg atg agt ttc etg aca 2106 
Phe Arg Ala Lys Thr Ser Glu Ala Ser Asp Val Met Ser Phe Leu Thr 
295 300 305 

tgc tac cae ttt gac etg eac tgg gag cac cac aga tgg cec ttt gcc 2154 
Cys Tyr His Phe Asp Leu His Trp Glu His His Arg Trp Pro Phe Ala 
310 ' 315 320 325 

cec tgg tgg cag etg cec eac tgc cgc cgc etg tec ggg egt ggc etg 2202 
Pro Trp Trp Gin Leu Pro His Cys Arg Arg Leu Ser Gly Arg Gly Leu 
330 335 340 

gtg ect gcc ttg gea tgaectggtc cetcegctgg tgacccagcg tctgcacaag 2257 
Val Pro Ala Leu Ala 
345 

agtgtcatgg agctcgaatt tecccgateg ttcaaacatt tggcaataaa gtttcttaag 2317 

attgaatcct gttgccggtc ttgcgatgat tatcatataa tttctgttga attaegttaa 2377 

gcatgtaata attaacatgt aatgcatgac gttatttatg agatgggttt ttatgattag 2437 

agtcccgcaa ttatacattt aatacgegat agaaaaeaaa atatagegcg caaactagga 2497 

taaattateg cgcgcggtgt catctatgtt aetagatcgg gaatte 2543 
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<210> 2 
<211> 346 
<212> PRT 

<213> Artificial Sequence 

<223> Description of Artificial Sequence: deduced fusion protein of 

transit peptide + peptide with beta-carotene C-4 oxygenase activity 

<400> 2 

Met Ala Ser Ser Met Leu Ser Ser Ala Tlir Met Val Ala Ser Pro Ala 

15 10 15 

Gin Ala Thr Met Val Ala Pro Phe Asn Gly Leu Lys Ser Ser Ala Ala 
20 25 30 

Phe Pro Ala Thr Arg Lys Ala Asn Asn Asp He Thr Ser He Thr Ser 
35 40 45 

Asn Gly Gly Arg Val Asn Cys Met Ser Arg Met Pro Ser Glu Ser Ser 
50 55 60 

Asp Ala Ala Arg Pro Ala Leu Lys His Ala Tyr Lys Pro Pro Ala Ser 
SS 70 75 80 

Asp Ala Lys Gly He Thr Met Ala Leu Thr He He Gly Thr Trp Thr 
85 90 95 

Ala Val Phe Leu His Ala He Phe Gin He Arg Leu Pro Thr Ser Met 
100 105 110 

Asp Gin Leu His Trp Leu Pro Val Ser Glu Ala Thr Ala Gin Leu Leu 
115 120 125 

Gly Gly Ser Ser Ser Leu Leu His He Ala Ala Val Phe He Val Leu 
130 135 140 

Glu Phe Leu Tyr Thr Gly Leu Phe He Thr Thr His Asp Ala Met His 
145 150 155 160 

Gly Thr He Ala Leu Arg His Arg Gin Leu Asn Asp Leu Leu Gly Asn 
165 170 175 

He Cys He Ser Leu Tyr Ala Trp Phe Asp Tyr Ser Met Leu His Arg 
180 185 190 

Lys His Trp Glu His His Asn His Thr Gly Glu Val Gly Lys Asp Pro 
195 200 205 

Asp Phe His Lys Gly Asn Pro Gly Leu Val Pro Trp Phe Ala Ser Phe 
210 215 220 

Met Ser Ser Tyr Met Ser Leu Trp Gin Phe Ala Arg Leu Ala Trp Trp 
225 230 235 240 

Ala Val Val Met Gin Met Leu Gly Ala Pro Met Ala Asn Leu Leu Val 
245 250 255 
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Phe Met: AXa Ala Ala Pro He Leu 
260 

Gly Thr Tyx Leu Pro His Lys Pro 

275 260 

Gin Val Met Ala Trp Phe Arg Ala 
290 295 

Met Ser Phe Leu Thr Cys Tyr His 
305 310 

Arg Trp Pro Phe Ala Pro Trp Trp 
325 

Ser Gly Arg Gly lieu Val Pro Ala 
340 


Ser Ala Phe Arg Leu Phe Tyr Phe 
265 270 

Qlu Pro Gly Pro Ala Ala Gly Ser 
285 

Lys Thr Ser Glu Ala Ser Asp Val 
300 

Phe Asp Leu His Trp Glu His His 
315 320 

Gin Leu Pro His Cys Arg Arg Leu 
330 335 

Leu Ala 
345 
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